首页> 外文OA文献 >From one star to three stars : Upgrading legacy open data using crowdsourcing
【2h】

From one star to three stars : Upgrading legacy open data using crowdsourcing

机译:从一星级到三星级:使用众包升级旧式开放数据

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Despite recent open data initiatives in many coun-tries, a significant percentage of the data provided is in non-machine-readable formats like image format rather than ina machine-readable electronic format, thereby restricting theirusability. This paper describes the first unified framework forconverting legacy open data in image format into a machine-readable and reusable format by using crowdsourcing. Crowdworkers are asked not only to extract data from an image of achart but also to reproduce the chart objects in spreadsheets.The properties of the reconstructed chart objects give theirdata structures including series names and values, which areuseful for automatic processing of data by computer. Sinceresults produced by crowdsourcing inherently contain errors,a quality control mechanism was developed that improves theaccuracy of extracted tables by aggregating tables created bydifferent workers for the same chart image and by utilizingthe data structures obtained from the reproduced chart objects.Experimental results demonstrated that the proposed frameworkand mechanism are effective.
机译:尽管在许多国家中最近有开放数据倡议,但是所提供的数据中有很大一部分是以诸如图像格式之类的非机器可读格式,而不是以机器可读的电子格式,从而限制了它们的可用性。本文介绍了第一个统一框架,该框架通过使用众包将图像格式的旧式开放数据转换为机器可读和可重用的格式。不仅要求人群工作者从achart图像中提取数据,而且还要在电子表格中复制图表对象。重建的图表对象的属性提供了其数据结构,包括序列名称和值,可用于计算机自动处理数据。由于众包产生的结果固有地包含错误,因此开发了一种质量控制机制,该机制通过汇总不同工人针对同一图表图像创建的表并利用从复制的图表对象获得的数据结构来提高提取表的准确性。实验结果表明,提出的框架和机制是有效的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号